A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration

نویسندگان

Steven van de Par

Armin Kohlrausch

Richard Heusdens

Jesper Jensen

Søren Holdt Jensen

چکیده

Psychoacoustical models have been used extensively within audio coding applications over the past decades. Recently, parametric coding techniques have been applied to general audio and this has created the need for a psychoacoustical model that is specifically suited for sinusoidal modelling of audio signals. In this paper, we present a new perceptual model that predicts masked thresholds for sinusoidal distortions. The model relies on signal detection theory and incorporates more recent insights about spectral and temporal integration in auditory masking. As a consequence, the model is able to predict the distortion detectability. In fact, the distortion detectability defines a (perceptually relevant) norm on the underlying signal space which is beneficial for optimisation algorithms such as rate-distortion optimisation or linear predictive coding. We evaluate the merits of the model by combining it with a sinusoidal extraction method and compare the results with those obtained with the ISO MPEG-1 Layer I-II recommended model. Listening tests show a clear preference for the new model. More specifically, the model presented here leads to a reduction of more than 20% in terms of number of sinusoids needed to represent signals at a given quality level.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scalable audio coding employing sorted sinusoidal parameters

This paper describes the use of sorted sinusoidal parameters to produce a fixed rate, scalable, wideband audio coder. The sorting technique relies on the perceptual significance of the sinusoidal parameters. Sinusoidal coding permits the representation of a given signal through the summation of sinusoids. The parameters of the sinusoids (the amplitudes, phases and frequencies) are transmitted t...

متن کامل

Audio coding using sorted sinusoidal parameters

This paper describes a new audio coding scheme based on sinusoidal coding of signals. Sinusoidal coding permits the representation of a given signal through the summation of sinusoids. The parameters of the sinusoids (the amplitudes, phases and frequencies) are transmitted to allow the signal reconstruction. In the proposed scheme, the sinusoidal parameters are sorted according to energy conten...

متن کامل

Linear prediction of audio signals

Linear prediction (LP) is a valuable tool for speech analysis and coding, due to the efficiency of the autoregressive model for speech signals. In audio analysis and coding, the sinusoidal model is much more popular, which is partly due to the poor performance of audio LP. By examining audio LP from a spectral estimation point of view, we observe that the distribution of the audio signal’s domi...

متن کامل

Perceptual audio modeling with exponentially damped sinusoids

This paper presents the derivation of a new perceptual model that represents speech and audio signals by a sum of exponentially damped sinusoids. Compared to a traditional sinusoidal model, the exponential sinusoidal model (ESM) is better suited to model transient segments that are readily found in audio signals. Total least squares (TLS) algorithms are applied for the automatic extraction of t...

متن کامل

Fractal Sinusoidal Modelling For Low Bit-Rate Audio Coding

This paper proposes a fractal sinusoidal model that is able to reduce the bit-rate of sinusoidal model coders while achieving perceptually lossless quality. This is achieved by removing the redundancy between sinusoidal tracks through encoding similar tracks with the transformation between a template track and the original track. This paper proposes a transform that is able to capture the perce...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

EURASIP J. Adv. Sig. Proc.

دوره 2005 شماره

صفحات -

تاریخ انتشار 2005

A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration

نویسندگان

چکیده

منابع مشابه

Scalable audio coding employing sorted sinusoidal parameters

Audio coding using sorted sinusoidal parameters

Linear prediction of audio signals

Perceptual audio modeling with exponentially damped sinusoids

Fractal Sinusoidal Modelling For Low Bit-Rate Audio Coding

عنوان ژورنال:

اشتراک گذاری